Image and Audio Annotation: Approximate Inference in Dense Conditional Random Fields

نویسندگان

  • Andrew C. Miller
  • Erik B. Sudderth
چکیده

In the task of image and audio annotation, it is typical for labels to be assigned independently. While some correlations may be modeled, it is rare for all interdependencies between labels to be considered. This is because exact inference on a dense, cyclical graphical model is often intractable. This paper applies an approximate inference method, log-determinant relaxation (LDR), to a fully connected conditional random field (CRF) to perform automatic image and audio annotation. The proposed model estimates the conditional distribution of all labels in a vocabulary given the audio or image features. Two other discriminative models incorporating a varying degree of context are used as a baseline for comparison. We show that introducing context to CRFs improves annotation performance, and can be made tractable with LDR. At the time of writing, the application of LDR to a discriminative model is novel.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Continuous Conditional Random Fields for Efficient Regression in Large Fully Connected Graphs

When used for structured regression, powerful Conditional Random Fields (CRFs) are typically restricted to modeling effects of interactions among examples in local neighborhoods. Using more expressive representation would result in dense graphs, making these methods impractical for large-scale applications. To address this issue, we propose an effective CRF model with linear scale-up properties...

متن کامل

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

Amortized Inference and Learning in Latent Conditional Random Fields for Weakly-Supervised Semantic Image Segmentation

Conditional random fields (CRFs) are commonly employed as a post-processing tool for image segmentation tasks. The unary potentials of the CRF are often learnt independently by a classifier, thereby decoupling the inference in CRF from the training of classifier. Such a scheme works effectively, when pixel-level labelling is available for all the images. However, in absence of pixel-level label...

متن کامل

A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields

This paper presents a method for categorizing named entities in Wikipedia. In Wikipedia, an anchor text is glossed in a linked HTML text. We formalize named entity categorization as a task of categorizing anchor texts with linked HTML texts which glosses a named entity. Using this representation, we introduce a graph structure in which anchor texts are regarded as nodes. In order to incorporate...

متن کامل

Improved Initialisation and Gaussian Mixture Pairwise Terms for Dense Rnadom Fields with Mean-field Inference

Many labelling problems in computer vision are often modelled as discrete optimisation problems such as object class segmentation, stereo correspondence, image de-noising etc. Generally, these problems are solved in a markov random field (MRF) or conditional random field (CRF) framework, where the basic model includes pairwise terms defined over a grid with 4 or 8 neighbours. A more expressive ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010